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Recombinant Protein Expression 



The invention relates to methods for increasing the yield of folded recombinant protein in 
host cells. 

All publications, patents and patent applications cited herein are incorporated in full by 
5 reference. 

The overproduction of recombinant proteins in cellular systems jfrequently results in 
misfolding of these proteins. The fates of the misfolded recombinant proteins differ. They 
may refold to the native state or be degraded by the proteolytic machinery of the cell or be 
deposited into biologically inactive large aggregates known as 'inclusion bodies'. 

10 The folding of proteins and the refolding of misfolded soluble and aggregated proteins is 
known to be mediated by a network of evolutionarily conserved protein molecules called 
chaperones (Hartl, F.U., Nature, 381, 571-580, (1996); Horwich, AX., Brooks Low K., 
Fenton, WA., HirshJSeld, LN. & Furtak, K., Cell 74, 909-917 (1993); Ellis, R.J. & 
Hemmingsen, SM., TiBS, 14, 339-342, (1989); Bukau, B., Hesterkamp, T. & Luirink, J., 

15 Trends Cell Biol, 6, 480-486, (1996); Bukau, B., Deuerling, E., Pfimd, C. & Craig, EjV-, 
Cell, 101, 119-122, (2000)). Major chaperones include members of evolutionarily 
conserved protein families, including the Hsp60 family (which includes the bacterial 
chaperone GroEL), the Hsp70 family (which includes the bacterial chaperone DnaK), the 
HsplOO family (which includes the bacterial chaperone ClpB), the Hsp90 family (\^ch 

20 includes the bacterial chaperone HtpG), the bacterial Trigger factor family, and the small 



HSPs (which includes the bacterial proteins IbpA and IbpB). 

Bacterial systems like the gram-negative bacterium Escherichia coli are a popular choice 
for the production of recombinant proteins. In E. coli, it is known that the DnaK and 
GroEL/ES chaperone systems assist the de novo folding of proteins (Hartl, F.U., Nature, 
25 381, 571-580, (1996); Ewalt, Hendrick, J.P., Houry, W,A. & Hartl, F.U. Cell 90, 

491-500 (1997); Bukau, B., Deuerling, E., Pfund, C. & Craig, RA., Cell, 101, 119-122,. 
(2000); Teter, S.A. et al., Ce//, 97, 755-765, (1999); Bukau, B. & Horwich, AX, Cell, 92, 
351-366, (1998); Deuerling, E., Schulze-Specking, A., Tomoyasu, T., Mogk, A. & ^ukau, 
B. Nature 400, 693-696 (1999)). 

30 Furthermore, DnaK and its co-chaperones DnaJ and GrpE are presently considered to forppi 
the most efBcient chaperone system for preventing the aggregation of misfolded proteins 
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(Mogk, A. et al., EMBO J„ 18, 6934-6949, (1999); Tomoyasu, T., Mogk, A., Langen, H., 
Goloubinoif, P. 8c Bukau, B., Mol Mici obiol, 40, 397-413, (2001); Gragerov, A. et al., 
Proc. Natl Acad Set U,S.A, 89, 10341-10344 (1992)). Increased levels of GroEL and its 
co-chaperone GroES have been shown to prevent the heat induced aggregation of proteins 
5 in cells deficient of other major chaperones (Tomoyasu, T., Mogk, A., Langen, H., 
GoloubinoflF, P. & Bukau, B., Mol, Microbiol., 40, 397-413, (2001); Gragerov, A. et al., 
Proc, Natl Acad Sci. USA. 89, 10341-10344 (1992)). 

Moreover, the disaggregation of protein aggregates in E. coli using chaperones has been 
proven for many difTerent cellular proteins in vivo (Mogk, A. et al., EMBO X, 18, 6934- 

10 6949, (1999)), as well as in vitro using thermolabile malate dehydrogenase (MDH) as a 
reporter enzyme (Goloubinoff, P., Mogk, A., Peres Ben Zvi, A., Tomoyasu, T. & Bukau, 
B., Proc. Natl Acad Sci., USA 96, 13732-13737, (1999)). Protein disaggregation is 
achieved by a bi-chaperone system, consisting of ClpB and the DnaK system. Large 
aggregates of MDH could be resolubilised in vitro and MDH was refolded afterwards into 

15 its native structure. Importantly, only the combination of both chaperones is active in 
resolubilisation and refolding of aggregated proteins. A recent publication showed that the 
resolubilisation of recombinant proteins from aggregates in vivo is possible. In these 
experiments, protein aggregates were generated by temperature upshift, and the 
solubilisation and refolding of these proteins was measured in the presence of protein 

20 synthesis inhibitors to ensure that only the pre-existing aggregated proteins were 
monitored. Molecular chaperones were able to resolve the aggregates under these 
conditions. 

Previous studies also indicate that the solubility and yield of recombinant proteins could be 
enhanced by the overproduction of chaperones. Co-oveiproduction of GroEL/GroES 

25 enhanced the solubility of several recombinant proteins synthesised in E. coli (human 
ORP150, human lyso2yme, pSO^'^ protein tyrosine kinase, phoq^homannose isomerase, 
artificial fusion protein PreS2-S'-B-galactosidase) (Amrein, KJB. et al., Proc. Natl Acad 
Sci., USA 92, 1048-1052 (1995); Nishihara, K., Kanemori, M., Yanagi, H. & Yura, T., 
Appl Environ. Microbiol, 66, 884-889 (2000); Thomas, J.G. & Baneyx, F., Mol 

30 Microbiol 21, 1185-1196 (1996); Proudfoot, AJE., GoflFm, L., Payton, MA., Wells, TJSI. 
& Bernard, Ail. Biochem J318, 437-442. (1996); Dale, G.E., Sch5nfeld, H.J., Langen, H. 
& Stieger, M., Protein Engineering, 7, 925-931 (1994)). The overproduction of the DnaK 
system together with recombinant target proteins elevates the solubility of endostatin. 
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human ORP150, transglutaminase and the fusion protein PreS2-S'-B-galactosidase 
(Nishihara, K., Kanemori, M., Yanagi, H. & Yura, T., Appl Environ. Mia'obiol, 66, 884- 
889, (2000); Thomas, J.a & Baneyx, R, J Biol Chem 271, 1114M1147 (1996); 
Yokoyama, K., Kikuchi, Y. & Yasueda, H., BioscL Biotechnol Biochem. 62, 1205-1210 
5 (1998)). 

So far, no systematic approach has been made, to analyse whether the combination of all 
three chaperones systems (DnaK, DnaJ GrpE; GroES, GroEL and ClpB) expressed 
together with target genes in E. coli cells enhances solubility of recombinant proteins. 
Furthermore, none of the above-described studies allows the widespread optimisation of 

10 expression systems that is required to improve yields of soluble proteins on a general level.. 
For example, each of the prior investigations focused on only one or a very small number 
of target proteins. These investigations also focused on the use of only one or two 
combined chaperone systems. In addition, none of these investigations addressed the issue 
of the importance of the ratio of the chaperones to one another and to the recombinant 

15 target protein. The previous studies therefore did not provide any understanding of the 
relationship between different chaperone proteins with respect to the folding/refolding of 
recombinant target proteins. 

Accordingly, there remains a great need in the art for a general method to improve the 
yield of soluble recombinant protein in a given expression system. Such a method would 
20 allow the optimisation of expression systems to give maximal yields of soluble target 
proteins, and be of obvious industrial and commercial benefit. 

The present invention is based upon the systematic engineering of cells for the controlled 
co-overexpression of different combinations of chaperone genes and target genes. In 
addition, it was investigated whether in vivo disaggregation and refolding of recombinant 
25 proteins from aggregates/inclusion bodies could be stimulated by enhanced levels of 
chaperones when the production of the target protein is stopped. As a result, the invention 
provides novel methods of optimising a given expression system in order to achieve higjier 
yields of the desired soluble recombinant protein. 

According a first aspect of the present invention, there is provided a method for the 
30 expression of a recombinant protein of interest, said method compri^g: 

a) culturing a host cell which expresses: 
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i) one or more genes encoding one or more recombinant protein(s) of 
interest; 

ii) at least two genes encoding proteins selected firom the group consisting 
of the chaperone proteins GroEL, GroES, DnaK, DnaJ, GrpE, ClpB and 

5 their homologs (for example, Hspl04, Ydjl and Ssal in yeast); under 

conditions suitable for protein expression; and 

b) separating said recombinant protein of interest from the host cell culture. 

Through the recombinant engineering of host cells in this manner, the invention provides 
novel methods for producing a recombinant protein of interest, which have been found to 
10 lead to significant improvements in the levels of protein produced in the system. The 
mechanism is thought to be through increasing the folding rates of particular proteins using 
the co-expression of particular chaperones in controlled amounts. Using this system, very 
high yields of the desired soluble recombinant proteins of interest can be obtained. 

Any recombinant protein of interest may be produced using the system of the invention. 
15 Preferred examples of proteins of interest will be apparent to the skilled reader. Particularly 
preferred recombinant proteins are those for which it is desirable to produce a large 
amount, and those of conomercial interest. 

Furlheraiore, the invention is readily applicable to a wide range of known expression 
systems by alterations in the cell culture techniques employed. For example, anaerobic 
20 fermenter-based cell culture would be appropriate for the culture of obligate anaerobes, 
whereas standard aerobic cell culture techniques would be appropriate for obligate aerobes. 
The nutrient composition of the culture medium may also be varied in accordance with the 
chosen expression system. The most suitable method of cell culture for a given exixression 
system will be readily apparent to the skilled man. 

25 Preferably, the genes selected in step a) ii) include DnaK, DnaJ and GrpE or homologs 
thereof, and may additionally include ClpB or a homolog thereof. 

In another preferred aspect of the invention, the genes selected in step a) ii) include GroES 
and GroEL or homologs thereof. 

More preferably, the genes selected in step a) ii) include the DnaK, DnaJ, GrpE, ClpB, 
30 GroES and GroEL genes or homologs tiiereof. 
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The above combinations of chaperone proteins have been found to be particularly suitable 
for use in the methods according to the invention. 

According to a further embodiment of the first aspect of the present invention, there is 
provided a method for the expression of a recombinant protein of interest, said method 
5 comprising: 

a) culturing under conditions suitable for protein expression a host cell which 
expresses: 

i) one or more genes encoding one or more recombinant protein(s) of 
int^est; 

10 ii) one or more genes encoding proteins selected firom the group consisting 

of the chaperone proteins GroEL, GroES, DnaK, DnaJ, GrpE, ClpB and 
their homologs (for example, Hspl04, Ydjl and Ssal in yeast); 

iii) one or more genes encoding proteins selected from the group consisting 
of the smdll heatshock proteins of the IbpA family and/or the IbpB 
1 5 family and/or their homologs; and 

b) separating said recombinant protein of interest from the host cell culture. 

The inclusion of a small heatshock protein of the IbpA family and/or the IbpB family with 
one or more of the chaperone proteins GroEL, GroES, DnaK, DnaJ, GrpE, ClpB in a host 
cell with a gene encoding a protein of interest has been shown to bestow significant 
20 beneficial ejffects on the level of expression of the recombinant protein. 

For the purposes of this patent specification, two genes or proteins are said to be 
'homologs' if one of the molecules has a high enough degree of sequence identity or 
similarity to the sequence of the other molecule to infer that the molecules have an 
equivalent fiinction. 'Identity' indicates that at any particular position in the aligned 

25 sequences, the amino acid or nucleic acid residue is identical between the sequences. 
'Similarity' indicates that, at any particular position in the aligned sequences, the amino 
acid residue or nucleic acid residue is of a similar type between the sequences. Degrees of 
identity and similarity can be readily calculated (Computational Molecular Biology, Lesk 
AJM., ed., Oxford University Press, New Yoric, 1988; Biocomputing, Informatics and 

30 Genome Projects, Smith, D.W., ed., academic Press, New York, 1993; Computer Analysis 
of Sequence Data, Part 1, GrifiBn, A-M., and GrifiBn, H.G., eds., Humana Press, New 
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Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 
New Jersey, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds», M 
Stockton Press, New York, 1991). 

The chaperone proteins for use in the invention therefore include natural biological 
5 variants (for example, allelic variants or geographical variations within the species from 
which the genes are derived) and mutants (such as mutants containing nucleic acid residue 
substitutions, insertions or deletions) of the genes. For the purposes of this application, 
greater than 40% identity between two polypeptides is considered to be an indication of 
functional equivalence. Preferred polypeptides have degrees of identity of greater than 
10 70%, 80%, 90%, 95%, 98% or 99%, respectively. It is expected that any protein that 
functions effectively as a chaperone, or as part of a chaperone system, within the host cells 
of the expression system wiU be of value in the described methods. 

Preferably, the levels of the respective chaperone proteins are controlled in conjunction 
with the methods described above. Preferably, the levels of chaperone proteins are 

15 controlled by expressing the genes encoding the respective chaperone proteins from 
different promoters. Preferably, a selection or all of the promoters used are inducible* 
Different promoters may have difTerent strengths and may respond to the same induction 
agent with different kinetics or be responsive to a different induction agent, allowing 
independent control of the expression level of each chaperone protein. Suitable promoters 

20 wUI be apparent to those of skUl in the art and examples are given in standard textbooks, 
including Sambrook et al., 2001 (Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY); Ausubel et al., 1987-1995 (Current 
Protocols in Molecular Biology, Greene Publications and Wiley Intersdence, New York, 
NY). Examples of suitable promoters include IPTG-regulated promoters, such as the PAl 1 

25 and lac-Ol promoters (see Tomoyasu, 2001 ). 

Altematively, or in addition, the respective chaperone proteins are expressed using 
expression systems of different strength. Examples of different expression systems wiU be 
clear to those of skill in the art; discussion of such systems may be found in standard 
textbooks, including Sambrook et al., 2001 (supra) and Ausubel et al., (supra). For 
30 example, the plasmid vector of the expression system may be a high copy number or low 
copy number plasmid. For instance, examples of Kcoli compatible low copy number 
plasmids include pSClOl and pl5A ori. 
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Preferably, the chaperone proteins are over-expressed relative to the expression levels that 
occur naturally in non-recombinant cells. 

Similarly, the invention provides for the levels of the chaperone proteins relative to the 
recombinant protein(s) of interest to be controlled by expressing the genes encoding the 
5 respective proteins firom different promoters, for the reasons described above. For example, 
in a system that utilises an IPTG-inducible promoter for expression of chaperone proteins, 
an arabinose-inducible promoter may be used to control expression of the recombinant 
protein of interest. In addition, the expression of the chaperones and of the recombinant 
proteins(s) can be controlled using diflFerent polymerases. 

10 hi a second aspect, the invention also provides methods comprising the use of a block in 
protein synthesis during the culturing steps a) described above. Preferably, the block in 
protein synthesis is imposed by addition of an effective amount of a protein synthesis 
inhibitor to the culture system, once a desired level of recombinant protein of interest has 
accumulated. More preferably, the chosen protein synthesis inhibitor is chloramphenicol, 

15 tetracycline, gentamydn or streptomycin. In order to ensure that protein synthesis is 
adequately inhibited, an effective amount of a protein synthesis inhibitor should be added. 
Details of effective amounts of protein synthesis inhibitor wiU be apparent to the skilled 
reader and are noted in standard textbooks. For example, for use in prokaryotic host cell 
systems, 200|ig/mL chloramphenicol is effective to inhibit protein synthesis. 

20 Any other method that inhibits protein synthesis may also be of value for use with the 
methods of the invention. This includes the use of mutant strains that are conditionally 
defective in protein synthesis, for example because of the temperature sensitivity of an 
enzyme involved in plasmid or host cell DNA replication or in target gene and host gene 
transcription or in protein translation. The imposition of such a block in protein synthesis 

25 has been foimd to lead to significant increases in the level of recombinant protein that is 
genemted in the system of the invention. 

Alternatively, or in addition, the invention also provides for the use of a reduction in gene 
transcription, by removal of any agents that are effective to induce recombinant protein 
expression (such as IPTG for Lac repressor controlled genes), once a desired level of 
30 recombinant protein of interest has accumulated. Alternatively, a reduction of construct 
transcription could be achieved via the addition of a transcription blocking compound 
(such as glucose for catabolite repressable genes). 
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This aspect of the invention thus provides a method for the expression of a recombinant 
protein of interest, said method comprising: 

a) cnlturing a host cell which expresses: 

i) one or more genes encoding one or more recombinant protein(s) of 
5 interest; 

ii) one or more genes encoding one or more proteins selected from the 
group consisting of the chaperone proteins GroEL, GroES, DnaK, DnaJ, 
GrpE, ClpB and their homologs (for example, Hspl04, Ydjl and Ssal in 
yeast); under conditions suitable for protein expression; 

10 b) imposing a block in protein synthesis, for example by addition of an effective 
amoimt of a protein synthesis inhibitor to the culture system, once a desired level of 
recombinant protein of interest has acciamvdated; and 

c) separating said recombinant protein of interest from the host cell culture. 

Also provided is a method for the expression of a recombinant protein of interest, said 
1 5 method comprising: 

a) culturing a host cell which expresses: 

i) one or more genes encoding one or more recombinant protein(s) of 
interest; 

ii) one or more genes encoding one or more proteins selected from the 
20 group consisting of the chaperone proteins GroEL, GroES, DnaK, 

DnaJ, GrpE, ClpB and their homologs (for example, Hspl04, Ydjl 
and Ssal in yeast); under conditions suitable for protein expression; 

b) imposing a reduction in gene transcription, for example by removal of any agents 
that are effective to induce recombinant protein expression (such as IPTG for Lac 

25 repressor controlled genes), or via the addition of a transcription blocking 

compound (such as glucose for catabolite repressable genes), once a desired level 
of recombinant protein of interest has accumulated; and 

c) separating said recombinant protein of interest from the host cell culture. 

One or more genes encoding proteins selected from the group consisting of the small 
30 heatshock proteins of the IbpA family and/or the IbpB family and/or their homologs may 
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also be included in the host celL The inclusion of such proteins m conjunction with the 
imposition of a reduction in gene transcription or the imposition of a block in protien 
synthesis. 

Preferably, a combination of chaperone proteins is expressed as described above. 

5 Preferably, the chaperone proteins are expressed under a different promoter to that used to 
control expression of the recombinant protein of interest. 

Preferably, the chosen protein synthesis inhibitor is chloramphenicol, tetracycline, 
gentamycin or streptomycin. 

Preferably, in the methods of the above-described aspects of the invention the cultured host 
10 cell is a prokaryotic cell, such as an coli cell, a Lactococcus cell, a Lactobacillus cell or 
a Bacillus subtilis cell, or a eukaryotic cell such as a yeast cell, for example a Pichia or 
Saccharomyces yeast cell, or an insect cell, for example after baculoviral mfection. 

Preferably, an optimised yield of recombinant protein of interest is manifested by 
increasing the level of de novo protein folding. 

15 An optimised yield of said recombinant protein of interest may also be nfianifested by 
increasing tibe level of in vivo refolding of aggregated, or misfolded soluble, recombinant 
protein. 

An optimised yield of said recombinant protein of interest may also be manifested by 
increasing the level of in vitro refolding of aggregated, or misfolded soluble, recombinant 
20 protein. 

An optimised yield of said recombinant protein may also be manifested by increasing the 
level of de novo protein folding in combination with increasing the increased level of in 
vivo refolding and/or in vitro protein refolding. 

Preferably, said increased level of folding or refolding results in increased solubility of the 
25 recombinant protein of interest. 

Preferably, said increased level of folding or refolding results in increased activity of the 
recombinant protein of interest 

According to a third aspect of the present invention there is also provided a method for 
increasing the degree of refolding of a recombinant protein of interest, said method 
30 comprising adding a composition containing a chaperone protein to a preparation of the 
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recombinant protein of interest in vitro. This has been found to increase significantly the 
degree of refolding of protein in preparations containing wholly or partially unfolded 
protein. The preparation of the recombinant protein of interest may be any preparation that 
contains protein that is partially or wholly imfolded or misfolded. Preferably, the 
5 preparation is a cell extract preparation, such as a lysate of a prokaryotic cell. 

Preferably, a combination of chaperone proteins as described above is added to the 
preparation of the recombinant protein of interest. For example, such ch^erone proteins 
may include one or more genes encoding one or more proteins selected firom the group 
consisting of the chaperone proteins GroEL, GroES, DnaK, DnaJ, GrpE, CIpB and their 
10 homologs (for example, Hspl04, Ydjl and Ssal in yeast), and optionally one or more 
genes encoding proteins selected firom the group consisting of the small heatshock proteins 
of the IbpA family and/or the IbpB family and/or their homologs. 

The preparation of the recombinant protein of interest may be a preparation of soluble 
recombinant protein that has been precipitated in vivo, or may be a preparation of in vitro 
15 precipitated recombinant protein (for example, a host cell extract containing the 
recombinant protein aggregate). 

Preferably, said composition containing the chaperone protein(s) is added after removal of 
any agents that are effective to induce soluble recombinant protein e3q>ression (such as 
IPTG for Lac repressor controlled genes) or after addition of a transcription blocking 
20 compound (such as glucose for catabolite repressable genes). 

Preferably, the third aspect of the invention is vised in conjunction with imposing a block in 
protein syndesis, for example by addition of an effective amoimt of a protein synthesis 
inhibitor to the culture system. As described above, chloramphenicol, tetracycline, 
gentamycin and streptomycin are examples of suitable protein synthesis inhibitors. 

25 Preferably, when practising the above-described methods, the time course of refolding and 
the temperature at which refolding occurs is controlled. The time course of refolding and 
temperature at which it occurs are known to have a significant effect on the yield of soluble 
recombinant protein, and are thus an important aspect of a given expression system to be 
optimised for the maximal yield of soluble recombinant protein. 

30 Preferably, when practising the above-described methods, a composition containing a 
protein selected from the group consisting of the small heatshock proteins of the IbpA 
family and/or the IbpB fanuly and/or their homologs is used in conjunction with the 
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chaperone proteins GroEL, QroES, DnaK, DnaJ, GapE, and/or ClpB and/or their 
homologs. 

A further aspect of the present invention relates to methods for the prophylaxis, therapy or 
treatment of diseases in which aggregated proteins are implicated, comprising the 
S administration of the described combinations of chaperone proteins and/or small heatshock 
proteins in sufficient amounts. Such diseases include, but are not limited to diseases in 
which amyloid deposits are implicated, such as late and early onset Alzheimer's disease, 
SAA amyloidosis, hereditary Icelandic syndrome, multiple myeloma, and spongiform 
encephalopathies. 

10 Various aspects and embodiments of the present invention will now be described in more 
detail by way of example. It will be appreciated that modification of detail may be made 
without departing from the scope of the invention. 

Brief description of the Figures 

Figure lA shows chaperone co-overproduction systems tested in E, colL Genes encoding 
15 three different chaperone-systems (GroEL/ES; DnaK, DnaJ, GrpE; and ClpB) were cloned 
in a psdr of low copy number vectors, which are compatible with E. coli (pSClOl and 
pi 5 A ori), carry the lacl^ gene and different resistance markers for selection. Chaperone 
genes are set under the control of IPTG-regulated promoters (PAll/lacOl) for controlled 
expression. Each combination of vector pairs (1 to 5) differs in its combination and level of 
20 chaperone expression. In these strains subsequently a third plasmid encoding a substrate 
protein was introduced. 

Figure IB shows chaperone expression patterns. The chaperone combinations 2 to 5 are 
shown. The lefi hand lane of each pair is loaded with a sample for which expression of the 
recombinant proteins had not been induced. The right hand column for each chaperone 
25 combination shows an IPTG-induced sample. 

Figure 2. Chaperone and target protein co-expression under IPTG control. The target 
proteins Tep4, Btke and Lzip were purified by metal afBnity chromatography after 
transformation in BL21(DE3) cells used as a control (K) and in tibe same strain but co- 
expressing the 5 different chaperone combinations reported in Figure 1. 

30 Figure 3. In vivo induced refolding. Figure 3 A shows the Btke expression level after 
chaperone-induced re-fblding in BL21(DE3) cells used as a control (K) and in the same 



wo 03/057897 PCT/IB03/00299 

12 

strain but co-expressing the 5 different chaperone combinations reported in Figure 1. Cells 
were grown at 30**C, induced with 0.1 mM IPTG, grown overnight, and then either grown 
2 more hours (first lane of each combination) or pelletted, re-suspended in fresh medium 
plus 200ng/mL chloramphenicol and cultured 2 more hours (second lane). Figure 3B 

5 shows optinusadon of the re-folding conditions using the chaperone combination 4 shown 
in Figure 3B. After overnight culture at 20®C the cells were pelletted, resuspended in fresh 
medium and cultured Ih, 2h, 3h, and 4h at 20°C (1 to 4), or Ih and 2h at 3TC (5 and 6) in 
the presence of 200|ig/mL chloramphenicol. For each combination the first lane was 
loaded with the uninduced sample and the second with the treated one. Figure 3C shows 

10 Btke expressed in control (CI) and chaperone combination 4 (C2) cells. Lanes were loaded 
with uninduced samples (K), induced and cultured at 20°C overnight plus two hours at the 
same temperature , pelleted after overnight growth, resuspended in fresh medium plus 
200|ig/mL chloramphenicol and cultured 2 more hours, as in 2 but in the presence of ImM 
IPTG instead of chloramphenicol, resuspended in fresh medium for Ih, 2h, and 4h. The 

15 numbers shown below the gel image indicate the increase factor obtained comparing the 
intensity of the bands to the reference (induced cells without chaperone co-expression). 
Figure 3D shows the effect of growth conditions on re-folding efficiency of Btke. Cells 
were grown overnight at 20*'C (1)1) and at 42°C before inducing the re-folding at 20*'C 
(D2). Lanes were loaded with un-induced samples (K), induced and cultured overnight 

20 plus two hours (1), resuspended in fresh medium plus 2h culture (2), in firesh medium plus 
200 ^g/mL chloramphenicol and cultured 2 more hours (3). Figure 3E shows the re- 
folding efficiecicy of Tep4 e?q>ressed in control (El) and chaperone combination 4 (E2) 
cells. Lanes were loaded with uninduced samples (K), induced and cultured overnight plus 
two hours (1), resuspended in firesh medium plus 2h culture (3), in fresh medium plus 200 

25 |ig/mL chloramphenicol and cultured 2 more hours (4). 

Fig. 4. In vitro re-folding. Figure 4A shows Btke expressed either in control cells (c) or in 
cells co-expressing chaperone combination 3 or 4. 3h after IPTG induction, cells were 
harvested and lysate prepared as described above. Samples containing lOO^g lysate were 
supplemented with lOmM ATP and 3mM PEP and 20ng/ml PK. After mdicated 
30 timepoints, soluble Btke protein was isolated and analysed by SDS-PAGE and Coomassie 
staining. Figure 4B shows the results produced when pellets with insoluble Btke were 
isolated firom control cells. Pellets were suspended in bufTer and where indicated 
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chaperones were added. After 5 min, 2, 4, and 20 h soluble Btke protein was isolated as 
described above and analysed by SDS-PAGE and silver staining. 

Figure 5 shows the results of experiments to test the eJffects of various combinations of 
different sHSPs and HSPs on the refolding of soluble MDH complexes in vitro. 

5 Figure 6 shows the results of experiments to test the effects of different HSP combinations 
on the refolding of soluble a-glucosidase/sHSP 16.6 and citrate synthase/sHSP 16.6 
complexes in vitro. 

Figure 7 shows the results of experiments to test the effects of different HSP combinations 
on the refolding of aggregated luciferase and soluble luciferase/sHSP 16.6 complexes in 
10 vitro 

Figure 8 shows the results of KJE/ClpB-mediated refolding of MDH. The different 16.6 
concentrations present during MDH denaturation are shown as the indicated 16.6/MDH 
ratio. Refolding curves for KJE-mediated refolding of MDH are indicated. Refolding 
curves for refolding of MDH carried out in presence of ClpB/DnaK are differently 
15 coloured. The precise 16.6/MDH ratios during MDH denaturation are indicated to the right 
of the graph and are as follows: green (16.6/MDH ratio=0); light blue (16.6/MDH 
ratio=0.25); brown (16.6/MDH ratio= 0.5); dark blue (16.6/MDH ratio=l); yellow 
(16.6/MDH ratio=2); pink (16.6/MDH ra!io==4). 

Figure 9 shows the results of experiments to determine the effect on protein refolding of 
20 varying the concentmtion of ClpB. 

Figure 10 shows the results of experiments to determine the effects of mutations to the 
il^AB genes and DnaK genes of E. coli. 

Figure 11 shows a comparison between the effects of mutations to the ibpAB and clpB 
genes in E, coli on the thermotolerance of those strains. 

25 Figure 12 shows the results of experiments to detemiine whether IbpA/B protein function 
increases in importance in the presence of reduced levels of DnaK and at elevated 
temperatures. 

Figure 13 shows the results of experiments to determine the levels of protein aggregation 
associated witii heat shock in tdbpAB tsx:lpB double knockout E. coli cells. 
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Figure 14 shows the effect of IpbAB co-expression on the level of soluble target proteins 
produced in E. coli cells. 

Figure 15: Effect of plasmid interactions on the level of the recombinant protein 
expression. A) Recombmant chaperone (K-DnaK, ELS-GroELS, ClpB) accumulation in 

5 bacteria homogenates. B) Accumulation of co-expressed recombinant chaperones and 
target protem GTRl in the homogenates recovered from control (C) and induced (I) 
bacteria. C) Effect of chaperone co-transformation on the not induced (C) and IPTG- 
induced (T) expression of the target protein Btk cloned in pET24d. D) Effect of the co- 
transformation with an empty pDMl vector on the not induced expression of Btk cloned m 

10 pET24d. 

Figure 16: Co-expression of the coil-coiled region of Xklp3A/B. The chains A and B were 
cloned in a polycistronic vector and expressed either in BL21 (DE3) together with the 
recombinant chaperone combmation K+J+E+ClpB+GroELS (+ chap) or in BL21 (DE3) 
pLysS in the presence of 1% glucose (-chap). 

15 Figure 17: Effect of unsynchronised recombinant chaperone expression on the level of 
soluble target recombinant protein. The independent mduction of the chaperones and target 
proteins has been obtained using arabinose-regulated vectors for the target proteins and 
IPTG-inducible vectors for the chaperones. In the figures axe reported the bands 
corresponding to the soluble target protein purified by affmity chromatogmphy from 0.5 

20 mL of bacterial culture. A) Amount of soluble GTRl and coiled-coil XklpB A recovered 
from wild type cells and bacteria co-transformed with different chaperone combinations. 
Bqpression was mduced by 0.2 mM HTG and 1.5 mg/mL arabinose were added 20 min 
later. The samples were collected 3 hours after the IPTG induction. B) Amount of soluble 
GTRl recovered from bacteria co-transforaied with K+J+E+ClpB+GroELS and using 

25 different combinations of time and expression-inducer concentrations. The samples were 
collected 3 hours after the addition of the first inducer and the bands corresponding to 
GroEL are recovered from SDS-gels loaded with the soluble fraction after cell lysis. C) 
Amount of soluble coiled-coil Xklp3B recovered from bacteria co-transfoimed with 
K+J+E+ClpB+GroEL after overnight culture (ON) at 20^C. The replacement of the ON 

30 medium with fresh medium (Fr. Md.), 0.2 mM chloramphenicol (Chlor.) and the 
temperature shift to SO^'C were used to stimulate the in vivo re-folding of the aggregated 
target protein. 
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Examples 

Examples 1-5 below illustrate the materials and methods used to mvestigate the effect of 
co-expressing different chaperone combinations on the yield of a large variety of different 
5 recombinant proteins. 

Example 1 : Construction of Chaperone Vectors 

Plasmids carrying chaperone genes under the control of the IPTG-sensitive promoter 
PAl/lacO-1 were constructed as described (Tomoyasu, T., Mogk, A., Langen, H., 
Goloubinoff, P., Buckau, B., Mol. Microbiol., 40, 397-413, (2001)). Target protein vectors 
10 were delivered to the Protein Expression Unit fix)m different research groups working at 
the European Molecular Biology Laboratory. 

Example 2: Transformation procedure 

Competent BL21 (DE3) and ToplO cells were transformed with the following couples of 
plasmids for selective expression of ch^rone combinations (Fig.lA). The clones for 

15 DnaK, DnaJ, and GrpE were carried by pBB530 and pBB535; the co-expression of DnaK, 
DnaJ, GxpE, and ClpB was regulated by pBB535 and pBB540; GroEL/BS system was 
expressed by pBB528 and pBB541; a large amount of the complete system DnaK, DnaJ, 
GrpE, ClpB, and GroEL/ES was ensured by pBB540 and pBB542; finally, a lower 
expression level of the same chaperone combination was obtained using pBB540 and 

20 pBBSSO. A complete array of single chaperone plasmid transformed cells was also 
prepared as a control. Transformed cells were checked for chaperone expresdon and 
successively made competent. The protease deficient strain BB7333 (MC4100 AclpX, 
AclpF, Alori) was used for transforming the Btkp protein. These strains were also made 
competent and used for a fiirther transformation with the target proteins. 

25 Example 3: Cell cultures 

Single colonies jBrom the transformed cells were used to inject 3 mL of LB medium. Liquid 
cultures were performed initially at 37**C, then transferred to 30*'C and finally transferred 
to 20^C. Using different times of incubation at the higher temperatures it was possible to 
reach the ODeoo of 0.8 at the same tune for all the different cell strains cultured together for 
30 comparative expression assays. Protein expression was performed overnight by inducing 
gene transcription using 0.1 mM IPTG. 1.5 mL of the overnight culture of both IPTG- 
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induced (hereafter termed 'induced) and control bacteria was directly centrrfuged in an 
Eppendorf tube and the pellet frozen and stored at -20''C. Alternatively, the pellet was re- 
suspended in 3 mL of j&esh medium and divided into two aliquots of 1.5 mL, with or 
without the addition of 200 fxg/mL chloramphenicoL After 2h culture at 20**C the cells 
5 were harvested as described before. Inclusion body overproduction was obtained by 
culturing the bacteria at 42°C overnight after induction. Large scale cultures were grown in 
2L flasks using 5 mL of overnight LB pre-culture to inoculate 500 mL of Terrific Broth. 

Example 4: Protein purification and evaluation 

Frozen bacterial pellets were re-su^ended in 350 pL of 20 mM Tris HCl, pH 8.0, 2mM 
10 PMSF, 0.05% Triton X-100, Ifxg/mL DNAase and 1 mg/mL lysozyme and incubated on 
ice for 30 min, with periodic stirring. The suspension was sonicated in water for 5 minutes, 
an aliquot (of homogenate) was stored and the rest was pelleted in a minifiige. An aliquot 
of the supernatant was preserved and the rest was added to 15 pL of pre-washed magnetic 
beads (Qiagen) and incubated ftirtfaer 30 min under agitation before being removed. Beads 
15 were washed 30 min with 20 mM K-phosphate buffer, pH 7.8, 300 mM NaCl, 20 mM 
imidazole, 8% glycerol, 0.2%Triton X-100 and later with PBS buffer plus 0.05% Triton X- 
100. Finally they were boiled in 12 pL SDS sample buffer and the samples loaded for SDS 
PAGE analysis, Tising a Pharmacia minigel system. Proteins were detected after coloration 
with Simply Blue Safestain (Invitrogen) following the manufacturer's instructions and the 
20 gels were recorded using a Umax Astra 4000U scanner. Bands corresponding to the 
proteins were analysed using the public NIH Image 1.62f software. Alternatively, protein 
was eluted from washed beads using 30}xL PBS buffer plus 0.5M imidazole and its relative 
concentration measured following its adsorbance at 280nm. The proper folding was 
evaluated by circular dichroism using a J-710 spectropolarimeter (Jasco). 

25 Example 5: In vitro experiments 

Cells were grown in LB and after inducing the synthesis of either Btke or Btke together 
with GroEL/ES (combination 3) or togetihter with GroEL/ES, DnaK, EtaaJ, GrpE, ClpB 
(combination 4) for 3h with ImM IPTG at 37^C, lysates were prepared as described 
above. For refolding of Btke from inclusion bodies using total lysate, lOmM ATP, 3mM 
30 phosphoenole pyruvate (PEP) and 20 ng/ml pyravate kinase (PK) were added and 
incubated at 20°C. After 5 min, 2, 4, and 20 h soluble material was separated from 
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insoluble fractions by centrifiigation (15 min, 4*'C, lO.OOOOipm) and the soluble fraction 
was used to isolate target protein as described above. 

For resolubilsation of isolated Bike aggregates with exogenous chaperone addition, lOOjig 
of total lysate (isolated from cells with overproduced Btke) was centrifuged for 1 5 min and 
5 pellets were resuspended in 20 mM Tris/HCl, lOOmM KCl and 20 mM MgCl. Chaperones 
were added as indicated and samples incubated at 20^C for 5 min, 2, 4, 20h. Soluble 
material was separated from inclusion bodies by centrifugation and isolated as described. 

Examples 6 to 9 below illustrate the optimisation of chaperone co-expression combinations 
and other experimental variables in order to greatly increase the yield of a large number of 
1 0 diverse recombinant proteins. 

Example 6: Investigation of the Effect of Chaperone Combinations on de novo 
Protein Folding 

Five different combinations of plasmids encoding chaperone systems (GroEL/ES; DnaK, 
DnaJ, GrpE and ClpB) in different combinations and amounts under the control of IPTG 

15 regulated promoters were introduced into BL21 (DE3) cells as illustrated in Fig. lA. The 
degree of chaperone expression was shown to be very high (Fig. IB). These cells were 
subsequently transformed with plasmids expressing substrate proteins in an IPTG 
controlled manner (Fig. lA). Therefore, co-expression of chaperones and target proteins 
was obtained by simultaneous induction of aU the promoters with IPTG. Co-expression of 

20 chaperones together with 50 different target genes was tested. For each target protein, all 
five different chaperone combinations were tested and solubility of the recombinant 
proteins analysed. In summary a higher yield of soluble substrate protein was achieved in 
more than 50% of the tested constructs (see Table 1 below). 

Table 1 shows a list of the proteins used in the survey for analysing the effect of chaperone 
25 co-expression on soluble target protein yield. The table shows the molecular weight of the 
constructs, the original organisms jBx)m \^ch they were cloned, whether they 
corresponded to full length proteins Q?l) or to domains, expressed alone or fused to a 
partner (fus), and their cell localisation (cytoplasm, membrane, nucleus, secreted) in vivo. 
The yield increase factor (IF) induced by the best chaperone combination is reported under 
30 'Chap- IF' and the yield increase factor obtained using the refolding protocol under 
'Refolding IF*. The symbol (/) signifies that the experiment has not yet been done and (!) 
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that protein has been obtained iising constructs that gave no soluble protein when 
expressed in wild type bacteria. 

Table 1: 



Protein 


MW 


Organism 


Features 


Chap. IF 


Refolding IF 


VI I Jvl 


40 kr> 




Fl/cyt 


3 


3 


■Puvp 






domain/cvt 


0 


28 


ApOti 


1 in IrD 


Xx, ScipJCOa 




0 


/ 


AKlpAl 


OZ, KU 


A. laevis 


Ui/IJU dill/ XUSi/ i» jr I 


0 




XklpBl 


4U KL> 


X. laevis 


uouiam/ lus/ cyi 


n 


t 


HbpH 


y isJ-) 


H. sapiens 


uomam/cyi 






TEVprotease 


30 kD 


TEV 


domain 




/ 




50 kD 


H. s^iens 


domain/cyt 


U 


i 
/ 


UCPI 


33 KD 


R. norvegicus 


domain/membr 


r\ 
O 


U 


Transcr Fact 


37 kD 


H. sapiens 


Fl/cyt 


U 


A 

u 


BtKe 


55 kD 


H. sapiens 


domain/cyt 


>l 
4 


AO 
HZ 


XklpA2 


38 kD 


X. laevis 


domain/fus/ cyt 


U 




XldpB2 


35 kD 


X. laevis 


domain/fus/cyt 


0 




XklpAS 


72 kD 


X. laevis 


domain/fus/cyt 


0 




Rolled 


43 kD 


D. melanogaster 


Fl/cyt 


4.5 


A < 


Lzip 


41 kD 


H. sapiens 


Fl/cyt 


t 




lAp 


52 kD 


D. melanogast^ 


Fl/nucl 


0 




Chip 


64 kD 


D. melanogast^ 


Fl/nucI 


0 




dLMO 


37 kD 


D. melanogaster 


Fl/nucl 


0 




m 


57 kD 


R. prowazekii 


FlAnembr 


0 




BtKc 


64 kD 


H. sapiens 


Fl/cyt 


3 




Pho^hK 


29 kD 


H. sapiens 


Fl/cyt 


3 




Compl.Tep3 


47 kD 


A. gambiae 


domain/fus 


4 




CompLTep4 


45 kD 


A. gambiae 


domain/ius 


3.5 




XklpA4 


72 kD 


X. laevis 


domain/fus/cyt 


2.5 


2^ 
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XklnB3 


73 kD 


X. laevis 


domain/fiis/cyt 


2.5 


2.5 


E8R1 


58 kD 


Vaccinia virus 


Fl/membr/fus 


7 


/ 


V^UJJJIJI. J. ^LfJ 


70 kD 


A. fambiae 


domain/fiis/secr 


0 


11 


#~^^«Mv%t "Tan A 






d OTn ain/fti ^ecr 


3.5 


13 


iViaXJr 


7 s ^cT^ 




domain 


3 


/ 


.AKip/\D 


J J JvL/ 




domain/fus/cyt 


0 


19 






V atfwiiijci V ju U9 


Fl/membr/fiis 


5.5 


5.5 


*jwy 


90 kD 


Z mavs 


Fl/membnme 


3 


5 


Mash 




jiiays 


Fl/rvt 


Q 


3 


T>T> A T* 


oo IrFk 


£>. con 


Fl/rvt 




3 




ISJJ 


m^l Slim o Act ^1* 




I 


3 


171 AT 




V <U«wlUla VJU Uo 




0 


0 


Jb>iiC 


An l/ri 
Hf KxJ 


VaCCIDla VITUS 




3.5 


3.5 


IFrenee 






ran 51 1TI / f*vt 


V 


? 


Tepl 


7 KE> 


A. gambiae 


domain/sear 




\i 


Tep2 


1 1 KU 


A. gambiae 


domain/secr 


A 


A 

V 


2Fre]ige 




xJ^ HldaDOgaSlcr 


iiV/lilclLU/ lUd 


0 


2 


Orf-tusion 




A. victoria 


FlflUS/Cyi 


A 

v 


0 




DUKU 


ri. sapiens 






g 




/ZKU 


n.. sapiens 


Pl/fiic 


I 


J 


XWpA+B 


15+17KU 


-rt^. laevis 


uomam/compiex 






Msl3 


14kD 


D. melanogaster 


domain/cyt 


2.5 


2.5 


Mash+Susv 


94+90 kD 


Z. mays 


Fl/complex 


3 


3 


Endostatin 


22 kD 


M. muscuius 


domain/secr 


0 


0 


Kringle 


30 kD 


H. sapiens 


domain/fus 


0 


0 



As can be seen from tbe 'Chap. IF* ratings, soluble target protein yield increased between 
2.5 and 7-fold. Effects of co-expressed chaperones were not limited to a certain type of 
substrate protein. The target proteins tested were representative of several different classes, 
5 including complexes, soluble, membrane-boimd and secreted proteins, full-length, domains 
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and fusion constructs, with a molecular weight spanning from 7.5 to 1 10 kD, expressed in 
the cytoplasm and in the periplasm (Table 1). Moreover, in some cases, like Lzip (see 
Table 1 and also Figure 2), co-expression of chaperones was the only possibility to obtain 
any soluble protein. Evaluation of the 23 positive cases indicated that the most efficient 
5 chaperone combination was the fourth, which expressed all three chaperone systems in 
large amounts, followed by the third, fifth, first and the second. Nevertheless, as is 
demonstmted in the case of LZip transcription factor where chaperone combination 1 
worked far better than the others, any one chsqperone combination is not necessarily 
Qptimal for all target proteins. Thus, despite the systematic approach it was not possible to 

10 infer general rules about the optimal conditions to succeed. No protein class showed better 
results in combination with particular chaperone combinations and no expression vector 
ensured significantly better yields. The only exception was when target proteins were 
cloned in high copy nximber vectors. In such a case no positive result was observed. The 
competition for the protein synthesis machinery could be considered as a reason, since 

15 chaperone expression is inhibited when a target protein was co-expressed and is 
completely prevented in cells harbouring expression vectors with pUC origin (data not 
shown). The results shown in Table 1 clearly demonstrate the very large increases in yield 
possible via the use of the disclosed methods. 

Example 7: Testing the Effect of Co-overexpression of Chaperone Combinations 
20 and Target Proteins on Re-folding of Aggregated Proteins Using 

Chloramphenicol 

In the experiments of Example 5 it was often observed that inclusion bodies accumulated 
even in the presence of overproduced chaperones increasing the amount of soluble 
proteins. A recent paper (Carrio, M. M. and Villaverde, A. FEBS Lett., 489, 29-33 (2001)) 

25 showed that soluble proteins could be recovered in vivo firom inclusion bodies when the 
protein synthesis was blocked by chloramphenicol addition and the whole cellular folding 
machinery became available for precipitated proteins. Therefore, we investigated the 
overexpression of chaperones not only for keeping recombinant proteins soluble but also 
for increasing the re-folding capability of cells. To investigate this further, we co- 

30 overexpressed chaperones and target genes as described before. Subsequently, we stopped 
protein synthesis by the addition of chloramphenicol. Cells were transferred to fresh media, 
incubated at 20'^C and resolubilisation of targets had been analysed at different time points. 
In fact, in the case of Btke the chloramphenicol-induced block of protein synthesis induced 
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a low increment of the soluble recombinant protein in control cells but an impressive 
increase when specific chaperone combinations were co-expressed simultaneously with the 
target gene prior to the translational arrest (Fig. 3A). It is worthy to note that for Btke the 
optimal chaperone combination differed when the soluble protein accumulated during 
5 standard culture conditions and when proteia synthesis has been blocked (Figs. 2 and 3 A). 
The choice of time and temperature conditions durimg re-folding was crucial for optimising 
the result (Fig. 3B). Longer incubation times or higher temperature lowered the amount of 
recovered soluble protein, probably because degradation by proteases takes over re-folding 
activity. As can been sera in Table 1 above, this method of combining chaperone co- 
10 overexpression with the blocking of protein synthesis resulted in a great improvement in 
the yield of recombinant protein in a large number of the combinations tested. 

Examples: Testing the Effect of Co-overexpression of Chaperone Combinations 
and Target Proteins on Re-folding of Aggregated Proteins by Reducing 
Construct Gene Transcription 

15 The protocol used to block protein synthesis, as described in Example 6 above, was 
evaluated by means of experiment. It was found that the original protocol can be simplified 
and that it was not strictly necessary to completely prevent protein synthesis in order to 
induce re-folding, and in fact the cessation of recombinant protein expression by removing 
the induction agent (IPTG) was sujSicient. In this case the target protein could be re-folded 

20 to a level comparable to that obtained in the presence of chloramphenicol but only in the 
presence of the recombinant over-expressed ch^erones (Fig. 3C). For Btke the optimal re- 
folding conditions enabled the recovery of 42-fold more protein than in the standard 
growth conditions using normal BL21 (DE3) cells and the simplified protocol (without 
chloramphenicol) gave an increase factor of 26. We also tried to induce the inclusion body 

25 formation culturing the bacteria at 42^C and starting the re-folding from a higher amount 
of material but the improvement was negligible (Fig. 3D), probably indicating that the 
limiting factor is represented from the folding machinery or from the cellidar degrading 
metabolism. These two factors seem to be somehow connected, as illustrated in the case of 
Tep4. In contrast to Btke this protein was expressed in soluble form at sufficient levels also 

30 at standard culture conditions and chaperone co-expression induced a limited yield 
increase (Fig. 3E). Nevertheless, the suppression of IPTG induction by simple exchange 
with firesh medium boosted tiie accxmaulation of soluble protein in both the strains but only 
the co-expression of recombinant chaperones could ensure the same results when 
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chloramphenicol was added. Generally, we observed that the addition of fresh medium 
alone was more effective than the combination of fresh medium and chloramphenicol in 
strains with wild type chaperone expression. This indicates that the removal of the inducer 
IPTG, and the subsequent cessation of transcription of the target gene, is sufBcient to allow 
5 refolding from inclusion bodies. It was a goal of the inventors to obtain more information 
about the relationship between protein re-folding and degradation by transforming our 
vectors in the protease deficient strain BB7333. However, the inventors were not able to 
raise a siifficient number of bacterial colonies. This finding confirmed the general role of 
proteases in maintaining cell viability (Tomoyasu, T., Mogk, A., Langen, H., GoloubinofF, 
10 P., Buckau, B., Mol. Microbiol., 40, 397-413, (2001)) and suggests that a certain degree of 
protein degradation must be maintained. It is therefore clear from the above example that a 
reduction in recombinant target gene transcription can also allow the refolding of 
aggregated proteins to proceed, leading to greatly improved yields of the soluble 
recombinant protein of interest. 

15 Protein synthesis inhibitors other than chloramphenicol, such as tetracycline, gentamycin 
and streptomycin have been tested with similar effects. 

Example 9: The Effect of Co-overexpression of Oiaperone Combinations and 
Target Proteins on Re-folding of Aggregated Proteins in vitro 

Next, we analysed whether co-expressed chaperones are capable of ^Jiancing the 
20 refolding of target proteins from inclusion bodies in vitro after cell lysis. For that purpose, 
we induced simultaneously synthesis of Btke together with eith^ chaperone combination 3 
or 4. Cells were harvested after induction and total lysates containing inclusion bodies and 
chaperones were isolated. Subsequently an ATP-regenerating system was added to the 
lysates and the soluble protein was purified after 5 min, 2h, 4h and 20h. Lysate containing 
25 the chaperone mixture 4, which was the most efficient during the in vivo refolding of Btke, 
showed already 5 min after the addition of ATP that approximately all Btke could be 
recovered in the soluble fraction. The control lysate, where only Btke was overexpressed, 
and the lysate with enhanced levels of GroEL/EL showed no significant recovery of 
soluble Btke (Fig. 4A). It is therefore clear that co-overexpressed chaperone mixtures 
30 stimulate re-solubilisation of inclusion bodies from bacterial cell lysates. Refolding of Btke 
inclusion bodies was also possible when chaperones were added exogenously to isolated 
aggregates (Figure 4B). However, refolding efficiency was much lower and refolding 
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kinetics much slower, most probably due to the Umited amovmt of added chaperones. This 
example clearly shows that co-expression of chaperones can also increase the yield of 
soluble recombinant protein via an enhancement of the refolding of target proteins from 
aggregates/inclusion bodies in vitro. 

5 The above Examples 1-8 have clearly shown the value of the methods provided by the 
present invention for increasing protein yield. The re-folding protocol applied to the 
chaperone transformed cells allowed even higher yields of soluble protein than the simple 
co-expression with the target proteins in 8 on 17 cases and, importantly, also gave positive 
results also in the case of 8 constructs insensitive to simple co-expression. Takmg all the 

10 results together chaperones had a positive effect on soluble protein accumulation in 68% of 
the cases analysed in our survey. The ratio remains basically the same if all the 50 
constructs are considered (34 positive) or if only the 37 diifferent proteins are taken in 
account (24 positive, 65%). It must be remarked that such a positive result has been 
obtained despite the fact that most of the constructs used in the experiment correspond to 

1 5 sequences difficult to be expressed in a soluble form in bacteria, like membrane-associated 
or secreted proteins, regions not corresponding to structural domains or complexes 
(underlined in Table 1). The advantage of the in vivo disaggregation is that protein 
refolding follows native patterns and, therefore, recovers its native conformation. The 
correct folding of some of the proteins was analysed by purification imtil homogeneity 

20 followed by circidar dichroism analysis, indicating that the proteins had adopted their 
native conformation after refolding. Importantly, the enzymatic activities of the kinases 
BIR and FIOL, the TEV protease and luciferase were also recovered after re-folding (data 
not shown). Larger scale cultures confirmed the trend observed in test cultures, suggesting 
that the disclosed methods are suitable for industrial applications. In summary, the 

25 invention provides not only a method for the production of large amounts of soluble 
recombinant protein, but also a method for the production of large amounts of recombinant 
protein that is correctly folded and furthermore retains the native protein's biological 
activity. 



30 In the following examples 10 and 11, the effect of small heat shock proteins (sHSPs) on 
the yield of soluble recombinant proteins both in vitro and in vivo was investigated. 
Published data had previously shown that members of the chaperone family of small heat 
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shock proteins (sHSPs), such as the E, coli family members IbpA and IbpB (IbpAB), can 
efficiently prevent the aggregation of unfolded proteins, although they were not shown to 
exhibit protein refolding activity. In the present study, refolding of substrates from 
sHSP/substrate complexes is reported to be dependent on an Hsp70 chaperone system 
5 (such as DnaK with its DnaJ and GrpE co-chaperones) in a reaction that can be further 
stimulated by the GroEL and GroES (GroELS) chaperones. 

Example 10: Investigation of the effect of small heat shock proteins on the yield of 
soluble recombinant proteins in vitro 

The refolding of several recombinant proteins from soluble complexes was tested: 
10 Materials and Methods: 

1 pM MDH was denatured in buffer A (50 mM Tris pH 7.5; 150 mM KCl; 20 mM MgCfe) 
for 30 mm at 47°C either in the presence of 6 pM 1 8.1 (pea), or 6 \}M IbpB (E, coli)^ or 4 
^iM 16.6 (Synechocystis sp.). MDH refolding was initiated at SO^'C by adding an ATP 
regenerating system (2 mM ATP; 3 mM PEP; 20 ng/ml pyruvate kinase) and various 
15 chaperone combinations made up from KJE (1 |iM DnaK; OJ. |xM DnaJ; 0.1 |jM GrpE), 
ESL (4 jJiM GroEL; 4 nM GroES) and ClpB (1.5 fiM). The results for these experiments 
are shown in Figure 5. 

Similarly, 1 nM a-glucosidase or 1 pM citrate synthase were denatured in the presence of 
4 nM 16.6 {Synechocystis sp.) in buffer A for 45 min at 50*'C or 47*^0, respectively. 
20 Protein refolding was initiated at 30°C by adding an ATP regenerating system (2 mM 
ATP; 3 mM PEP; 20 ng/ml pyruvate kinase) and various chaperone combinations made up 
from KJE, ESL and ClpB. The results for these experiments are shown in Figure 6. 

Similarly, 100 nM firefly luciferase was denatured in the absence or presence of 0,4 jiM 
16.6 (Synechocystis sp.) in buffer A for 15 min at 43''C. Luciferase refolding was initiated 
25 at 30*^0 by adding an ATP regenerating system (2 mM ATP; 3 mM PEP; 20 ng/ml 
pyruvate kinase) and various chaperone combinations made up from KJE (0.5 jiM DnaK; 
0.1 nM DnaJ; 0.05 |xM GrpE) and ClpB (0.5 \jM), The results for these experiments are 
shown in Figure 7. 

To investigate the effect of the stoichiometry of the sHSPs on the refolding of 
30 sHSP/substrate complexes 1 pM MDH was denatured in buffer A (50 mM Tris pH 7,5; 
150 mM KCl; 20 mM MgCt) for 30 min at 47°C in the presence of varying 16.6 
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concentrations. MDH refolding was initiated at 30®C by adding an ATP regenerating 
system (2 mM ATP; 3 mM PEP; 20 ng/ml pyruvate kinase) and various chaperone 
combinations made up from KJE (1 pM DnaK; 0.2 |iM DnaJ; 0.1 pM GipE) and ClpB (1.5 
|LiM). The results for these experiments are shown in Figure 8. 

5 Experiments were also carried out in which 1 pM MDH was denatured in buffer A (50 
mM Tris pH 7.5; 150 mM KCl; 20 mM MgC^) for 30 min at 47^C in the absence or 
presence of 0.5 pM 16.6. MDH refolding was initiated at 30®C by adding an ATP 
regenerating system (2 mM ATP; 3 mM PEP; 20 ng/ml pyravate kinase) and the DnaK 
system (1 pM DnaK; 0,2 pM DnaJ; 0,1 pM GrpE) in the presence of varying ClpB 
10 concentrations as indicated. The results for these experiments are shown in Figure 9. 

Results: 

All the sHSPs tested formed complexes with heat-denatured protein substrates such as 
malate dehydrogenase (MDH), firefly luciferase and alpha-glucosidase \^ch represented 
small protein aggregates. The data shown in Figure 5 show that ClpB strongly stimulates 

15 the DnaK-dependent refolding of the thermolabile reporter protein malate dehydrogenase 
(MDH) from various soluble sHSP/MDH complexes. This stimulatory effect was verified 
by analysis of the refolding of the substrates firefly luciferase, citrate synthase and a- 
glucosidase from complexes vsdth sHSP 16.6 (shown in Figure 6 and Figure 7). Notably, 
the refolding of substrates by ClpB/DnaK from sHSP/substrate complexes was in general 

20 much faster than refolding from aggregated proteins generated by identical denaturation 
conditions in the absence of sHSPs (Figure 7). The GroESL chaperone system was not able 
to refold any of the substrates tested from sHSP/substrate complexes, even in the presence 
of ClpB. However GroESL was observed to increase the rates of substrate refolding in the 
presence of DnaK or ClpB/DnaK, especially in case of MDH (Figure 5). Table 2 provides 

25 a summary of the results from these experiments: 
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Table 2: Refolding of thermolabile proteins from protein aggregates or soluble 
sHsp/protein complexes 



Substrate 


Chaperones 


KJE 


KJE/ESL 


KJE/ClpB 


KJE/CIpB/ESL 


aggr. MDH 


0.1 


0.2 


10.3 


25.1 


sHsp/MDH 


4.0 


9.9 


8.5 


27.5 


aggr. a-glucosidase 


0 


0 


1.73 


2.27 


sHsn/ot-fflucosidase 


0.44 


0.53 


2.69 


3.63 


on err oilTAtf* QvniTijiQf^ 


0 


0 


0.06 


0.1 


sHsp/citrate synthase 


0.12 


0.22 


0.4 


0.63 


aggr. luciferase 


0.01 


n.d. 


0.14 


n.d. 


sHsp/luciferase 


0.17 


n.d. 


0.48 


ii.d. 




Refolding rate (nM/min) 


MDH, a-glucosidase, citrate synthase and luciferase were denatured in the 
absence or presence of a 4-fold excess of 16.6. Substrate refolding was 
initiated by addition of an ATP-regenerating system and the indicated 
chaperone combinations (experimental details as desoibed above). Maximal 
rates of substrate refolding were derived firom the linear phase of the time 
curves of recovered enzymatic activity. 



5 On the basis of these results, we propose that sHSP/substrate complexes represent small 
protein aggregates and refolding of substrates from such complexes relies on a 
disaggregation reaction mediated by the DnaK system alone, or much more eflRciently by 
ClpB with the DnaK system. After their active extraction from the complex, unfolded 
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substrates are subsequently refolded by a chaperone network formed by the DnaK and 
GroESL systems. 

Jn vivo the levels of sHSPs are often not sufBcient to prevent protein aggregation and 
sHSPs are usually found associated with protein aggregates. We investigated whether the 
5 presence of sHSPs in protein aggregates can facilitate their resolubilization and 
consequently increase substrate refolding. To answer this question the amount of sHSPs 
utilised in each experiment was titrated during the denaturation of MDH and the resulting 
consequences on DnaK or DnaK/ClpB-mediated MDH refolding were investigated. 
Substoichiometric concentrations of Hspl6.6 compared to MDH resulted in the formation 
10 of insoluble, turbid sHSP/MDH complexes which were, however, much smaller than MDH 
aggregates formed by denaturation in the absence of Hspl6.6 (Table 3). 



Table 3: Characterisation of 16.6/IVIDH complexes 









Size determination 








Dynamic 
Ughtscattering 


Static 
lightscattering 


16.6/MDH 
Ratio 


Lightscattering 
intensity (%) 


SolubiUty (%) 


Calculated 
radius (nm) 


Mass (Da) 


0 


100 


<10 


45 +/- 15 


n.d. 


0.25 


68 


<10 


33.7+/- 12.5 


1.8E+07-7.0E07 


0.5 


37 


18 


31.5+/- 9 


1.8E+07-7.0E+07 


1 


0 


57 


24 +/- 6 


5.6E+07 - 1.5E+07 


2 


0 


84 


19+/- 5 


2.3E+06 - 4.0E+06 


4 


0 


92 


14+/- 5 


1.5E+06-3.1E+06 


1 MDH was denatured in buffer A (50 mM Tris pH 7,5; 150 mM KCl; 20 mM 
MgCh) for 30 min at 47°C in the presence of varying 16.6 concentrations, given as 
16.6/MDH ratio. Turbidity (light scattering intensity) of formed MDH aggregates was 
set at 100%. Solubility of native, untreated MDH after centrifugation (13.000 rpm, 15 
min, 4°C) was set 100%. Size of the different sHSP/substrate complexes were 
determined either by dynamic or static Ughtscattering (coupled to gelfiltation) 
measurements. Both techniques were utilised in case of poorly soluble sHSP/MDH 
complexes leading to characterization of a subpopuladon of the complexes only. 
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Increasing Hspl6.6 concentrations increased the solubility and decreased turbidity and size 
of sHSP/MDH complexes (Table 3). Efficient DnaK-dependent MDH refolding required 
the presence of soluble sHSP/MDH complexes created in the presence of high Hspl6.6 
concentrations (Figure 8). In contrast ClpB/DnaK mediated MDH refolding did not show 

5 up such a severe dependency, however MDH activity was recovered at earlier timepoints if 
insoluble sHSP/MDH complexes mstead of MDH aggregates were used as starting 
material. This effect became much more severe, if the disaggregation potential of the 
ClpB/DnaK system was reduced by lowering the ClpB concentration OPig™® 9). The 
stimulatory effects described above were again observed vAicn substoichiometric 

10 concentrations of sHSPs were present during substrate denaturation (by heat), resulting in 
the formation of insoluble sHSP/substrate complexes. Thus the presence of sHSPs in 
insoluble protein aggregates can significantly fecilitate aggregate resolublization by 
ClpB/DnaK. 

The above example illustrates that refolding of substrates after their ClpB/DnaK mediated 
15 extraction fix>m sHSP/substrate complexes is in most cases stimulated by the GroESL 
chaperone system, indicating that released, imfolded substrates are refolded by a chaperone 
network. We conclude that sHSP function is coiQ>led to ClpB/DnaK dependent protein 
disaggregation and serves to prepare protein aggregates for faster resolubilization. 

20 Example 11: Investigation of the effect of small heat shock proteins on the yield of 
soluble recombmant proteins in vivo 

Materials & Methods: 

E, coU wild type or AibpAB or AdhaJf mutant cells were grown at 30^C to logarithmic 
phase and shifted to 45'*C for 30 min, followed by a recovery phase at BO^'C for 60 min. 
25 Protein aggregates were isolated at the indicated timepoints and analyzed by SDS-PAGE. 
The results for these experiments are shown in Figure 10- 

E, coli wild type or AibpAB or AclpB or AibpAB AclpB double mutant strains were grown 
at SO^'C to logarithmic phase. Cells were either shifted directly to 50°C or were 
preincubated at 42°C for 15 min. Various dilutions of stressed cells were plated on LB 
30 plates. After 18 h colony numbers were counted and survival rates were calculated in 
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relation to determined cell numbers before 50°C shock. The results for these experiments 
are shown in Figure 1 1 . 

. Various dilutions (10"^ to 10"^ of the cultures were spotted on LB plates supplemented 
with the indicated IPTG concentrations and incubated at 30**C, 37^C or 42°C for 18 h. The 
5 results for these experiments are shown in Figure 12. 

Various strains of i?. coli were grown overnight at SC'C in the presence of 500 yM IPTG. 
Cultures were washed twice with LB and inocxilated for further growth at 30°C in the 
presence of various IPTG-concentrations (0, 25, 50, 100 |jM) to logarithmic phase and 
shifted to 42^C for 30 min. Protein aggregates were isolated at the indicated timepoints and 
10 analyzed by SDS-PAGE. The results for these experiments are shown in Figure 13. 

In the experiments described above in examples 6-9 we expressed in E. coli strain 
BL21(DE3) several target proteins mcluding 2C18, E8R, Tep3 and Kringle with or without 
co-expression of difiorent combinations of tfie chaporones GroELS, ClpB, DnaK, DnaJ and 
GrpE. The chaperone combination which for each case yielded the highest levels of soluble 

15 target proteins was taken as "control" (overproduction of KJE/ELS/B for 2C18, Tep3, no 
chaperone overproduction for E8R and Kringle). To show the solubilization effects of 
overproduction of IbpA/IbpB together with other chaperones we genemted BL21(DE3) 
strains which carry plasmids expressing IPTG-regulatable genes encoding ttiese same 
target proteins and in addition plasnuds expressing IPTG regulatable genes encoding 

20 IbpA/IbpB (lanes marked IbpAB in Figure 14), IbpA/IbpB and GroELS (lanes marked 
IbpAB+GroELS in Figure 14), IbpA/IbpB and GroELS and DnaK/DnaJ/GrpE and ClpB 
Oanes marked IbpAB+compL in Figure 14). After IPTG induction the bacteria were 
cultured overnight at 20''C and directly collected (I), or the IPTG was removed and the 
pellet re-suspended in fresh medium and cultured for two additional hours without (N) or 

25 with 200 }ig/ml of chloramphenicol (C). For each combination the amount of soluble 
protein (after affinity purification of the target proteins in the soluble cell fractions) was 
identified on Coomassie-stained SDS-gels. The results for these experiments are shown in 
Figure 14. 

Results: 

30 K coli mutant cells missing the sHSPs IbpA/B do not exhibit a temperature-dependent 
growth pheno^ye (42*^0). However, we observed that the resolubilization of protein 
aggregates, created by severe heat treatment (45''C), was delayed in comparison to wild 
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type cells (Figure 10). Additionally the sxirvival rate (thermotolerance) of AibpAB mutants 
at lethal temperatures (SO^^C) was slightly reduced compared to wild type (Figure 11). 
Thermotolerance is linked to the ability of cells to rescue aggregated proteins and 
consequently the observed reduced thermotolerance of AibpAB mutants is likely caused by 
5 a less efiBcient resolubilization of protein aggregates. 

DnaK has been shown to be the major player in preventing protein aggregation in E. coli at 
high temperatures. We therefore investigated whether IbpA/B fimction could become more 
important in the presence of reduced DnaK levels, rendering E. coli cells more sensitive to 
protein aggregation. In vivo depletion of DnaK was achieved by replacing the a32- 

10 dependent promotor of the dnoKJ operon by an IPTG-inducible one. Reduced DnaK levels 
caused synthetic lethality in AibpAB mutant cells at elevated temperatures (37-42''C). The 
same experiments performed in a AclpB mutant strain and a AibpAB AclpB double 
knockout revealed an increasing necessity for higher DnaK levels at elevated temperatures 
(Figure 12). Especially in case of the AibpAB AclpB double knockout mutant strain this 

15 phenotype was linked to severe protein aggregation upon heat shock to 42''C (Figure 13), 
Thus in vivo IbpA/B is necessary for efficient protein disaggregation, especially under 
conditions which favour protein aggregation and lower the disaggregation potential of 
cells. 

As shown in Figure 14 and Table 4, the combined overproduction of IbpAB v\dth ClpB, the 
20 DnaK system and the GroEL system, and witit combinations of these ch^erones, increases 
the yield of soluble recombinant protein produced in E, coli cells. 
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Table 4: 



Protein 


MW 


Organism 


Features 


IpbAB IF 


SerprotAsl 




A. gambiae 


domain/fiis 


! 


Krin^e 


30 kD 


H. sapiens 


domain/fiis 


J 


2C18 


50kD 


H. sapiens 


pyfus 


2.5 


22j21 


72kD 


H. sapiens 


Fl/fiis 


0 


Tep3 


70 kD 


A. gambiae 


domain/fus/secr 


3^5 


Tep4 


68 kD 


A. gambiae 


domain/fiis/secr 


0 


XklpA3 


73 kD 


X. laevis 


domain/fiis/cyt 


0 


E8R1 


58 kD 


Vaccinia virus 


Fl/membr/fiis 


3.5 


BtKe 


55 kD 


H. sapiens 


domain/cyt 


0 


Nine proteins were tested for the effects of IpbAB co-expression on the level of 
soluble target proteins produced in E. coli cells. The increment j&ctor (IF) defines 
the fold increase (in the best condition, being either I, N or C; see above for 
definition) in amount of soluble protein due to IpbAB co-expression with respect 
to the controls (the best conditions identified firom examples 6-9). 1 denotes that 
the IpbAB-dependent expression of soluble proteins occurred which could not be 
produced in soluble form before. Thus, in 5 of the nine cases tested, the 
overproduction of IbpA/IbpB further increasesd the yield of target proteins. 



Thus, these in vivo data are consistent with the results obtained in vitro. Firstly, the yields 
of soluble recombinant protein produced in £1 coli ceUs can be increased in several cases 
5 tested vdien IbpA/IbpB is overproduced alone or together with various combinations of the 
DnaK and GroELS systems and ClpB. Secondly, E. coli AibpAB mutant cells missing 
IbpA/B exhibited a delayed protein disaggregation after heat shock (45*^0) and a reduced 
survival rate at lethal temperatures (SO^C) compared to wild type cells. IbpA/B fimction 
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became essential at elevated temperatures (37-42*'C) in the presence of reduced DnaK 
levels, conditions which favour protein aggregation and reduce the disaggregation potential 
of the cells. 

In summary, the above Examples 10 and 11 show that small heat shock proteins (sHSPs) 

5 co-operate with other chaperones, in particular with the ClpB chaperone, the DnaK 
chaperone system and the GroEL chaperone system, to solubilize and refold aggregation- 
prone proteins. This property can be exploited to increase the yield of soluble recombinant 
proteins produced in E. coli and other cells, and can be used for the in vitro production of 
soluble recombinant protein. In particular, the combined overproduction of IbpAB with 

10 ClpB, the DnaK system and the GroEL system, and with combinations of these 
chaperones, increases the yield of soluble recombinant protein produced in E. coli cells. 
However, the teaching provided by these experiments is of much broader importance since 
all the proteins involved in this folding reaction are members of large protein families with 
members among prokaryotes and eukaryotes (TbpA and IbpB are members of the family of 

15 sHSPs which includes alpha-cristallins; ClpB is member of the AAA protem family which 
include Hspl04; DnaK is member of the Hsp70 family; DnaJ is member of the DnaJ 
(Hsp40) family; GrpE is member of the GrpE family; GroEL is member of the Hsp60 
family; GroES is member of the GroES family). It is expected that the other members of 
the involved protein families can substitute for the E, coli members in protein folding 

20 reactions. In fact, we present biochemical data that the sHSP of Synechocystis, Hspl6.6, 
can inarease the efficiency of protein refolding in co-operation with the E. coli chaperones 
ClpB, DnaK, DnaJ, GrpE, and GroELS. Furthermore, since ClpB is a homolog of the S. 
cerevisiae Hspl04, a chaperone implicated in the generation and prevention of formation 
of amyloid fiber formation, it is also possible that our finding that the sHSPs co-operate 

25 with ClpB and the DnaK and GroEL systems in protein folding has implications on the 
fomiation or treatment of amyloid fib^s in eukaryotic cells, and diseases in which such 
fibers are implicated. 

Finally, it was found that the IbpA/B, ClpB and the DnaK systems act cooperatively to 
reverse protein aggregation. 

30 To elucidate the functional interplay between IbpA/B, KJE and ClpB in the protein quality 
control network more precisely, we determined the degree of protein aggregation in 
AibpAB, AclpB and AibpAB AcIpB mutants that have KJ adjusted to various levels. Since 
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ClpB and IbpA/B do not prevent protein aggregation in vivo (Mogk et al., 1999), increased 
amounts of aggregated proteins in the respective mutants would indicate a less efficient 
protein disaggregation. At 30°C no protein aggregation was detectable for all tested mutant 
strains, even in cells with greatly reduced KJ levels. After a 30 min incubation at 42^*0, 5% 

5 of cellular proteins aggregated in all mutant cells, provided that IPTG was omitted from 
the growth medium. Increasing KJ levels (by addition of IPTG) reduced the amount of 
aggregated proteins in each strain, but to different degrees dependent on the mutant 
background. While 50 pM IPTG in the growtti medium was sufficient to eliminate 
aggregates in AibpAB cells, 2% and 5% of total proteins still aggregated in AclpB and 

10 AibpAB AclpB mutant cells, respectively. Even in presence of DnaK/DnaJ levels 
corresponding to heat shock conditions (100 pM IPTG), 2% of ceUular proteins remained 
aggregated in AibpAB AclpB mutant cells. 

These findings are in complete agreement vsdth the hierarchial complementation of growth 
defects of these mutant cells at high temperatures and demonstrate the cooperative action 
1 5 of IbpA/B and ClpB in the KJE-mediated removal of protein aggregates in vivo. 



Example 12: Bacteria co-transformed with recombinant proteins and chaperones 
cloned in independent plasmids are suitable for expression tuning 

This example describes a system based on three vectors, where two are under IPTG 
20 regulation and enable the recombinant expression of six chaperones, and the third one is 
arabinose-indudble and harbours the sequence for the recombinant target protein of 
interest. In such a way, the independent induction and the level of expression of both 
chaperones and target protein was possible. The data showed that the expression leakage 
from pET vectors was prevented by the introduction of further plasmids in the cell and that 
25 the recombinant proteins compete for their expression. In fact, the high rate induction of 
one of them could switch off the accumidation of the other recombinant proteins. The first 
information was used to maximise the expression of toxic proteins while the cross- 
inhibition among recombinant proteins was exploited to modulate and optimise the target 
protein expression and to induce the chaperone-assisted in vivo re-folding of aggregated 
30 target protein. 

CloniQg and transformation procedures. 
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Chaperone proteins were expressed as described above. For expression of target protein, 
the sequences corresponding to GTRl (O 00582) and the motor regions of XklpSA and 
Xklp3B (AJ 31 1602; CAA 08879) were cloned in pTrcHis vector (trc promoter and ColEl 
replication origin), Tep3 (unpublished sequence from A. gambiae) was cloned in pGEX 
5 (tac promoter and pBR322 replication origin) and E8R (NP 063710) in pGAT (lac 
promoter and pUC replication origin). The sequences for GTRl, Tep3, E8R, the Xklp3A 
and B C-terminal regions of the coil-coiled domains and a domain of Btk (O 06187) were 
cloned in pBAD. pET24d and pETM60. The Xklp3A and B C-terminal regions of the coil- 
coiled domains were also cloned in the polycistronic vector pST39 (Tan, 2001). 

10 Cell cultures. 

Single colonies from the transformed cells were used to inject 3 mL of LB medium. Liquid 
cultures were incubated initially at 37°C, successively transferred to SO^'C or 20'*C, 
induced at an ODeoo of 0.8 and grown 3 hours or overnight, respectively. Variations of 
timing and concentration combinations used in the experiments with bacteria hosting both 
1 5 IPTG and arabinose regulated expression vectors are described case by case in the results. 

Protein purification and vield evaluation. 

Frozen bacterial pellets corresponding to 0.5 mL of culture were re-suspended in 350 \xL of 
20 mM Tris HCl, pH 8.0, 2mM PMSF, 0.05% Triton X-100 and 1 mg/mL lisozyme and 
incubated on ice for 30 min, with periodic stirring. The suspension was sonicated in water 

20 for 5 minutes, pelleted in a minifiige, the supematant was added to 20 pL of pre-washed 
Ni-NTA magnetic agarose beads (Qiagen) and incubated ftirther 30 min under agitation 
before being removed. Beads were washed 30 min with 20 mM K-phosphate buffer, pH 
7.8, 300 mM NaCl, 20 mM imidazole, 8% glycerol, 0.2%Triton X-100 and later with PBS 
buffer plus 0,05% Triton X-100. Finally they were boiled m 12 pL SDS sample buffer and 

25 the samples loaded onto a SDS PAGE using a Pharmacia minigel system. Proteins were 
detected after coloration with Simply Blue Safestain (Invitrogen) following the 
manufacturer's instructions and the gels were recorded using a Umax Astra 4000U 
scanner. Proteins were quantified analysing the gel bands with the public domain NIH 
Image program (developed at the U.S. National Institutes of Health and available on the 

30 Internet at http://rsb.info.nih.gov/nih-im^e/). 

Results and Discussion 
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Initially we transformed bacteria with chaperone-^arrying plasmids and in a second step 
these cells were re-transformed with a plasmid harbouring the target protein. The 
expression of all the plasmids was under IPTG regulation. The ceU co-transformation with 
three plasmids selected using different antibiotic resistances induced a 20% decrease of the 
5 cell growth rate; however, succeeded even in the case in which two plasmids (pGAT and 
pBAD) shared the same replication origin pUC. 

Bacteria transformed with two low copy number plasmids derived j&om pDMl and 
harbouring different chaperone genes expressed the corresponding proteins at very high 
level (Fig. 15 A). Nevertheless, the intensity of the bands separated in SDS-gel indicated 

10 that the expression of the target protein GTRl cloned into the pTrcHis vector strongly 
faihibited the chaperone accumulation (compare Fig. 15A and 15B) so that ClpB was no 
more detectable in the bacterial homogenate (Fig. 15B). In contrast, the expression of the 
target protein Btk by the leaking vector pET24d in the absence of the inducer IPTG was 
strongly repressed when a chaperone-containing plasmid was co-transformed in the host 

15 cell (Fig. 15C). These results suggest that there are two independent kinds of interaction 
raising from the presence of different plasmids in the same cell. The first one involves the 
plasmids and is independent from their protein products. In fact, the IPTG-independent 
expression of Btk cloned in a pET24d expression vector was prevented also in the case of 
the co-transformation with an empty pDMl vector (Fig 15D). A recent paper reports that 

20 the introduction of heterologous vectors has been shown to induce stress responses and 
inhibit biomass production m S. cerevisiae even though they were empty or non-induced 
(Gorgens et al, 2001, BiotechnoL Bioeng. 73, 238-245). 

The expression-leakage control obtained by co-transformation with more plasmids at once 
can be useM in the case of the expression of toxic proteins or when the leakage rate is so 

25 high to impair the normal cell function. At least one experi«ice in the frame of this work 
indirectly supports this hypothesis. A polydstronic plasmid (Tan, 2001) has been used for 
expressing a complex between the C-terminal end of the coil-coiled regions of Xklp3 chain 
A and chain B. No colony grew using BL21 (DE3) bacteria when we tried to transform 
them with the polycistronic plasmid. Cells co-transformed with chaperone plasmids were 

30 efBciently transformed with the polycistronic vector and gave colonies. Colonies grew also 
when the polycistronic vector was transformed into pLysS stram cells and 1% glucose was 
added to the growth medium to tightly control any expression leakage. However, the 
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bacterial yield was 60% less (data not shown) and the purified protein decreased of more 
than 80% (Fig. 16). 

Beside the case of plasmid interaction our results seem to indicate that the cell machinery 
involved in the protein production was challenged by the contemporary over-expression of 

5 too many recombinant proteins. The results of Figure ISA and 15B could be interpreted 
either as an overwhelming accumulation of target protein transcripts that inhibits the 
chaperone expression rate or a competition for the RNA polymerase. Such a competition 
has been described in E. coli between metabolic and recombinant genes (Schweder et al, 
2002, AppL Microbiol. Biotechnol. 58, 330-337) while recombinant and cell mRNAs 

10 could compete at transcriptional level in yeast (Gorgens et al, 2001, BiotechnoL Bioeng. 
73, 238-245). We observed that in case of co-transformation the effect of competition 
seems proportional to the estimated copy number of the target protein plasmid and 
independent on the promoter used (data not shown). In fact, recombinant vectors hosting 
the target protein with both T7 and lac promoters could inhibit the chaperone expression. 

15 Therefore, a competition at the transcriptional level would be ruled out in our system. The 
existence of a limit of total protein expression can have important consequences in the case 
in which recombinant chaperones are co-transformed to boast the production of a target 
protein. In fact, a too high level of expression of the latter could automatically inhibit the 
chaperone expression levels and, therefore, limit or prevent their positive folding effect. 

20 An alternative method has been envisaged in which chaperones and target proteins were 
cloned in vectors in which their expression was under difTerent regulation systems. This 
enables the indq^endent induction of chaperone and target protein expression and would 
allow exploitation of the chaperone-dependent folding improvement of the target protein 
avoiding any shortcomings due to contemporary co-expression. A logical approach seemed 

25 to induce the accumulation of the chaperones and then trigg^ the target protein ejqpression 
in a cell with boasted folding machinery. 

In a first set of experiments the GroELS chaperones were expressed by means of an 
arabinose-regulated vector (Castanie et al, 1997, Anal. Biochem. 254, 150-152) and the 
IPTG-dependent target proteins were induced after 30 minutes. The results did not show a 
30 significant increase of soluble target proteins and no improvement was detected varying 
incubation times and inducer concentrations (data not shown). 
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In a second attempt the target proteins were cloned into arabinose-regulated vectors while 
five different IPTG-dependent chaperone combinations were compared. Such an 
expression system mostly resulted in an increased yield of the soluble target proteins (see 
Table 5). 

5 



Protein 


MW 


Organism 


Improvement Factor 


GTRl 


40 kD 


S. cerevisiae 


3 


Btkp 


55 kD 


H. sapiens 


3 


XklpSA 


62 kD 


X. laevis 


00 


Xklp3B 


40 kD 


X. laevis 


9 


Tep3 


70 kD 


A. gambiae 


4 


E8R 


32 kD 


Vaccinia virus 


0 



Table 5. Chaperone-dependent yield improvement of soluble target proteins. Clones 
corresponding to the target proteins were co-transformed with the different chaperone 
combinations and cultured according to the best among the conditions reported in Figure 
17. The improvement factor enabled by chaperone co-expression indicates the ratio 



10 between the highest yield of soluble target protein obtained using cells co-transformed with 
chaperones and its amount recovered from cells not hosting recombinant chaperones; the 
symbol oo means that no soluble target protein was expressed in absence of chaperones. 
The target proteins were expressed in arabinose-regulated pBAD vectors and the different 
chaperone combinations listed in material and methods were induced by IPTG addition. 

15 

The optimal chaperone combination (Fig. 17 A) and the e3q)ression conditions were specific 
for each target protein. The complexity of the interactions among the different recombinant 
proteins is illustrated in the experiments summarised in Figure 17B and 17C. Soluble 
GTRl accumulation was induced at a similar level by both 0.5 and 1 .5 mg/mL of arabinose 
20 (Fig. 17B, lanes 1 and 2). The co-expression of low amoimts of K+J+E+ClpB+GroELS 
chaperones induced by 0.02 mM IPTG stimulated the accumulation of soluble GTRl 
whose expression was induced by 0.5 mg/mL of arabinose (Fig. 17B, lane 4). 
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Nevertheless, the amount of the soluble target protein decreased if EPTG-dependent 
chaperones were allowed to accumulate before the arabinose-dependent induction of GTRl 
(Fig. 17B, compare lanes 4 and 5). The same pattern of inhibition of soluble GTRl was 
observed when higher chaperone expression was induced by ten-fold higher IPTG 

5 concentration but, in such a case, the absolute amount of soluble GTRl was strongly 
reduced (lanes 6 and 7). These data confirm the existence of a competition among the 
products of different recombinant plasmids. In this case, both plasmids use the cell KNA 
polymerase and, therefore, it is not possible to distinguish between competition at the 
transcription or translation level. Nevertheless, the accumulation of soluble GTRl induced 

10 at low level of arabinose is progressively inhibited by an increasing amount of available 
chaperones (Fig. 17B). The inhibitory chaperone accumulation was obtained with both 
higher IPTG concentration and longer time of induction before the arabinose-dependent 
induction of GTRl. When we repeated the same experiments using 1.5 mg/mL of 
arabinose to induce a higher GTRl expression the results were reversed (Fig. 17B, 

15 compare lanes 8-11 and 4-7). As a matter of fact the higher arabinose concentration 
enabled a strong accumulation of GTRl; however, the increasing amounts of expressed 
chaperones did not reach a level critical for competition but could provide a more 
stabilising environment for GTRl. 

The conclusions firom this work are that chaperones can positively contribute to GTRl 
20 accumulation. Nevertheless, a ratio among the transcripts seems to be important for 
avoiding detrimental competition at the translation level. The parameters involved are the 
rate of induction of both chaperone and target genes and the time in which chaperones can 
accumiilate before the target protein is induced. 

Recently, it has been showed that recombinant proteins precipitated in aggregates could be 
25 re-solubilised in vivo. Aggregate re-folding was induced after that translation inhibition 
made available foldases and chaperones otherwise employed in metabolic folding (Carrid 
and Villaverde, 2001, FEES Letts. 489, 29-33). We applied this idea to our system in 
which the accumulation of recombinant chaperones was possible. 

The expression of coiled-coil Xklp3B was induced overnight at 0.5 mg/mL of arabinose 
30 (Fig 17C, lane 1). The amount of recovered soluble protein was low and inhibited or 
almost completely prevented when chaperones (K+J+E+GroELS+ClpB) expression was 
IPTG-induced together or before arabmose addition (Fig. 17C, lanes 2 and 3). These data 
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confinn the results collected using GTRl and explained considering a competition among 
the recombinant proteins (Fig. 17B). In contrast, the removal of the arabinose-containing 
medium and the addition of fresh medium plus chloramphenicol had a positive effect on 
the amoimt of recovered soluble protein (Fig. 17C, lane 4). Apparently, the standard 
5 cellular folding machinery is, therefore, sufficient to partially re-fold the aggregated 
recombinant target protein. Nevertheless, a strong re-solubiUsation improvement of the 
target protein was observed only when arabinose was removed, the pellet was re- 
suspended in fresh medhim and chaperone-expression was induced by 0.2 mM IPTG 
addition (Fig. 17C, lane 5). A similar improvement at a sUghdy lower extent was obtained 
10 by the simple addition of a sufficiently high amount of IPTG (0.2 m^4) to the arabmose- 
containing medium (Fig. 17C, lane 6). Therefore, it seems that it is possible to exploit the 
inhibitory effect of an overwhelming chaperone exjwession on the arabinose-regulated 
target protein to switch the system from Xklp3B to chaperone expression. Then, in 
conditions that inhibit the further expression of Xldp3B (comments to Fig. 17B), the 
1 5 available chaperones induce the re-folding of the akeady aggregated target protein without 
the need to remove the arabinose from the medium. 

The collected results provide new information concerning the co-transformation of more 
than one recombinant proteins and confirm that chaperone co-transformation can increase 
the amount of soluble target protein. They also indicate that interactions among 

20 transformed plasmids and among corresponding proteins need to find an equiUbrium in the 
host ceU to optimise the co-transformation benefit. In fact, it seems that chaperones can 
somehow compete with the target protein, meaning that some care is required to optimise 
each candidate system, although this is well within the ambit of the skiUed worker. 
Nevertheless, the reciprocal expression inhibition between target protein and chaperones 

25 can be exploited to tune the expression rate and improve the amount of soluble target 
protem. We must only be aware that the conditions need to be optimised since the 
accumulation rate is specific for each recombinant protein. 



